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Selection in a time-periodic environment is modeled via the two-player replicator dynamics. For 
sufficiently fast environmental changes, this is reduced to a multi-player replicator dynamics in a 
constant environment. The two-player terms correspond to the time-averaged payoffs, while the 
three and four-player terms arise from the adaptation of the morphs to their varying environment. 
Such multi-player (adaptive) terms can induce a stable polymorphism. The establishment of the 
polymorphism in partnership games [genetic selection] is accompanied by decreasing mean fitness 
of the population. 
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Environmental impact on adaptation, selection and 
evolution is an important subject of biological research 
P, 0, 0, 0, H, H, SH, 0] ■ The main scenario of environ- 
mental adaptation on the population level is polymor- 
phism [l|, [2j S 0] ■ O'' more clearly different types of 
phenotype (morph) exist in one interbreeding population. 
The basic mechanisms of polymorphism are heterozygote 
advantage and inhomogeneous (frequency, space and/or 
time-dependent) environment 3 • Polymorphism can 
be restricted to the phenotype level, or it may be con- 
trolled genetically by multiple alleles at a single locus 
, e.g., human ABO blood groups Here is one ex- 
ample of polymorphism related to a time-dependent en- 
vironment 0, [^. Forrest populations of the land snail 
Cepaea Nemoralis consist of three morphs havingrespec- 
tively brown, pink and yellow colored shells [1, (H]. The 
brown and pink morphs have an advantage over the yel- 
low morph at the spring time, since the background color 
makes them less visible for predators |^ iSj; the yellow 
morph has an advantage at summer and autumn on the 
yellow-green substrate. In addition, the yellow morph is 
more resistant to high and low temperatures 4,, 5.] . Thus 
different morphs have different advantage under different 
environmental conditions 

A varying environment has roughly three dimensions: 
it may be time or/and space dependent, predictable vs 
stochastic, and fine vs coarse-grained The latter 

means that each individual within population sees mainly 
one fixed environment, which can change from one gen- 
eration to another, for example. A fine grained environ- 
ment changes many times during the life-time of each 
individual; see the above example of Cepaea and note 
that this snail lives seven to eight years i4i, i5,] . 

Much attention was devoted to modeling polymor- 
phism in various coarse-grained environments [l|, [3, @, 



0, B @]- Fine-grained environments got less attention, 
since early theoretical results P, [l^l and the biological 
common sense [2] implied that non-trivial polymorphism 
scenarios are absent. One expects that in this case the 
organism sees (thus adapts to) the average environment 
[HM However, recent experiments indicate that the 
evolving populations can adapt to time- varying aspects 
of their fine-grained environment 
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ticular, they can respond to the environmental patterns 
other than the environmental mean [llj. Moreover, the 
total fitness during such an adaptation need not increase 
[l3| . A proper theoretical model for such phenomena is 
still absent. 

Here we present a theory for polymorphism in fine- 
grained, time-periodic environment based on Evolution- 
ary Game Theory (EGT). Our main method is the time- 
scale separation in the replicator dynamics. 

EGT describes interacting agents separated into sev- 
eral groups 0, [l^ . The reproduction of each group is 
governed by its fitness, which depends on interactions 
between the groups. The most popular replicator dy- 
namics approach to EGT describes the time-dependent 
frequency pk{t) of the group fc, which is the number of 
agents Nk in the group fc, over the total number of agents 
in all n groups: pk = Nk/J2k=i The fitness f/c of the 
group fc is a linear function of the frequencies [7l.ll4|: 



fk{a,p)^2^^^^ 



a-kiPi, 



,n. 



(1) 



where the payoffs au account for the interaction between 
(th e ag ents from) groups k and I. The replicator dynam- 
ics [14| facilitates the (relative) growth of groups with 
fitness larger than the average fitness "Y^^^iPifi'- 



Pk = Pk 



En 
i^^Pifi] = Gk[a,p]. 



(2) 
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Within game theory the groups correspond to strategies, 
while the pay-offs aki describe interaction between two 



2 



players: the probability pk of strategy k changes accord- 
ing to the average pay-ofF J^i o-kiPi receive d by one player 
in response to applying the strategy k [tI. 

There are several applications of EGT and replicator 
dynamics in biology: i) Animal (agent) contests, where 
the groups correspond to the strategies of agent's be- 
havior during the contest, while pk is the probability by 
which an agent applies the strategy k [IJ]. The actual 
mechanism by which pk changes depends on the concrete 
implementation of the model (inheritance, learning, imi- 
tation, infection, etc), ii) Selection of genes, where pk is 
the frequency of one-locus allele k in panmictic, asexual, 
diploid population, and where Uki = aik refers to the se- 
lective value of the phenotype driven by the zygote {kl) 
0. Then IHEl) are the Fisher equations for the selec- 
tion with overlapping generations in) The basic 
Lotka-Volterra equations of ecological dynamics can be 
recast in the form ([2]) and studied as replicators 14 1. 



Within the replicator approach polymorphism means a 
stable state, where two or more pk are non-zero. 

We consider a varying, but predictable environment, 
which acts on the phenotypes ma king aki periodic func- 
tions of time with period 27r/a; [tI. llSj: akiir) = akiir + 
2tt), t = uot. There are well-defined methods to decide 
to which extent a varying environment is predictable for a 
given organism [9] . The oscillating payoffs can reflect the 
fact that different morphs (alleles, strategies) are dom- 
inating at different times. Let us assume that the en- 
vironment varies fast [fine-graining]: the average change 
of the population structure over the environment period 
27r/w is small. We separate the time-dependent payoffs 
Ofci (t) into the constant part aki and the oscillating part 
akiir) with zero time-average Ofc; = j^'^ ^auir) = 0, 

auir) = aki +aki{T), drCLkiir) = auir), au = 0, (3) 

where iiki is defined to be the primitive of aa with its 
time-average equal to zero. 

Following the Kapitza method [l3| , we represent pk as 
a slowly varying part pk plus ek[p{t), t], which is smaller 
than Pk, fast oscillating on the environment time r, and 
averaging to zero: 



Pk{t)=Pk{t) + ek[p{t),T], 



^k[p{t)] 



dr 
2^ 



ek[p{t),T]=Q. 



(4) 
(5) 



Here the average is taken over the fast time t for a fixed 
slow time t. Note that the fast depends on the slow p. 
Now put (g]) into (O and expand the RHS of ^ over e: 

d^ + Oie^)]Gk[a(T),pU6) 

where the summation over the repeated Greek indices, is 
assumed and daX = -^-X. The fast factor is searched 

for via expanding over i: = ^ek,i + -^^k,2 -f . . .. Sub- 
stitute this into ([S]), add and subtract suitable averages. 
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FIG. 1: Schematic portrait of (|15|l for C = [upper dia- 
gram] and for C satisfying conditions (|17|l [lower diagram]. 
Stable [unstable] fixed points are denoted by circles [squares] . 
Arrows indicate direction of flow in time. 



and get for the fast terms: drEk.i = Gk[a{T),p] + 0{^). 
After straightforward integration, we have 

ekm,r] = -Gk[air),p{t)] + Oi^), (7) 

where a ~ {aki} is defined in ([3]). Once ([7]) is separated, 
the remainder in ^ is the evolution of slow terms 



Pk = Gk[a,p] + ea[p,T] daGk[a{T),p] + 0(l/a.'), (8) 

where the time-average is defined as in ([S]). Working out 
([8|) and using db = —ab we get again a replicator equation 

Pk = Pk [:Fk{p) - pa^FaiP)] , (9) 

J^kiP) = dkaPa + bkapPaPp + Ckap-iPaPpP-y (10) 

where Tk is the effective (already non-linear) fitness, and 



bklm = — aki dim, 

OJ 



Cklmn — aki amn- 
OJ 



(11) 



Here we defined ajfj = ^{aki + aik). Equations ^ [TU|) 
are our central result. Expectedly, the fast environment 
contributes the averaged payoffs du into J^k ■ This is well- 
known for any fine-grained environment [ij. 

However, besides this averaged two-party interaction, 
each group k gets engaged into three- and four-party in- 
teractions with payoffs bkim and Ckimn, respectively. In- 
deed, recalling our discussion after we can interpret 
J2im ^kimpipm in (HO]) as the average pay-off received by 
one of three players upon applying strategy k. The terms 
with bklm and Ckimn ™ l|10P exist due to adaptation of 
the morphs to their environment: while the frequencies 
of the morphs fast oscillate on the environmental time, 
see (UJ, on longer times the population sees the fast en- 
vironment as an effective many-player model. Note that 
the terms with bum and cumn need not be small as com- 
pared to (Zfez -terms, since the derivation of (|9HTT|) applies 
for pk ^ Cfc, which can hold even for dki — > 0. 

We get bklm = Ckimn = [due to dkidki = 0] , if only one 
aki varies in time, or if all dki oscillate at one phase: 'dki = 
a{t)^kh where S^ki are constant amplitudes. Indeed, the 
adaptive terms [with bkim and Ckimn] are non-zero due to 
interference between the environmental oscillations of a^; 
and those of e^, which are delayed over the environmental 
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oscillations by phase 7r/2; see ([T]). This is why there is no 
interference if all aki oscillate in phase. Thus for having 
the adaptive terms we need at least two morphs reacting 
on the environment differently ^ e.g., due to different delay 
times of reaction. For the adaptive terms we also need 
the frequency-dependent selection, e.g., they are absent 
for haploid replicators pk = Pk[ak{t) — o,a{t)pa\. 

Euation ([9]) implies that the relative growth of two 
morphs at slow (long) times is determined by the ef- 
fective fitness difference: ^{pk/pi) = {Pk/pi){^k - ^i)- 
Thus in the stable fixed points of (O the (effective) fit- 
ness of surviving morphs are equal to each other, while 
the fitness of non-surviving {pk —>■ 0) morphs is smaller 
(Nash equilibrium) [141] . Another pertinent quantity, the 
(fast) time-averaged fitness fk[p{t)] is the cumulative ef- 
fect of the short-time replication intensities. Employing 
(|31 O [7]) we deduce that even when the adaptive terms 
are taken into account, these two quantities are equal 
^k = fk{a + a,p + e). The overall long-time fitness of the 
population is characterized by the effective mean fitness 
Pa^a, which is also equal to its time-averaged analog: 



^ = [Pa + £a)iaaf3 + aaf3){p[3 + efj) ^ Pa^a 

= aappaPp + bap-yPaPlSP-r- (12) 

The contribution Cap^sPaPpp-fPs in PaJ-a nullifies due to 

(fTTj) and ab — ~ab. 

The mean fitness $ is especially important for part- 
nership games [genetic selection]: Uikir) — akiir), since 
for the constant payoff situation in the replicator equa- 
tion ([2]), $ monotonically increases towards its nearest 
local maximum over the set of variables pk (fundamen- 
tal theorem of natural selection) 0, [3l • As follows from 
(fTTI [T2 |l . for Uikir) = Ukiir), $ reduces to the averaged 
two-player contribution: ^ — J2a0PaO,af3Pi3- However, 
the theorem is not valid in the presence of the adaptive 
terms, and the mean fitness can decrease; see below. 

Let us now study concrete examples. For n = 2, Eq. ^ 
simplifies to a closed equation for the frequency pi 

Pi^Pi{l-pi)[A{t)-B{t)pi], (13) 
A{t) EE ai2(t) - a22(t), B{t) = 2ai2(t) - aii(t) - 022(0: 

where without much loss of generality we adopted 
ai2(T) = 021 (r). Using notations (0]) and defining 

1 



C = - AB ^ — [ai2 (022 - an) + 0,22 an], (14) 

Ll) UJ 

for the adaptive factor, we deduce from ^ \8\ [T3|) 

Pi^Piil~Pi)[A~Bp,-Cpi{l-pi)]. (15) 

C vanishes for the symmetric homozygotes, aii(r) = 
022 (t), and for one recessive allele, e.g. ai2(T) — a22(T). 
Both cases are easily solvable from showing that the 
long-time behavior of pi is indeed governed by A. 
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FIG. 2: The portrait of UHl for di2 = 0.1, ais = 0.1, 
a23 ~ —0.1, Ki = —0.65 and K2 = 0.65; pi and p2 are re- 
stricted by < pi < 1, < P2 < 1, Pi + P2 < 1. Two 
fixed points are denoted by square (saddle) and cycle (cen- 
ter). The closed orbits contain the center in their interior; 
orbits from the second class converge to pi = 1. These two 
classes of orbits are separated by a dashed curve (separatrix) , 
which is made by joining together two unstable directions of 
the saddle. Arrows indicate direction of flow. 

The vertices pi — 1 and pi — are always fixed points 
of (|15p . while two interior fixed points are 



^1,2 ^—[B + Ct^J{B + C)^-4AC ], TTi < 712.(16) 

If TTi and 7r2 are in (0, 1), then tti {112) is stable (unstable). 
The analysis of (jlSp reduces to the following scenarios. 

1. For A > and A > B [i.e., an > ai2 > 022] the 
morph 1 globally dominates for C = 0, i.e., for all initial 
conditions pi goes to 1 for large times; see Fig. [T] The 
global dominance does not change for C < 0. One can 
call this morph generalist since its fitness does not 
oscillate in time (e^ in ([7]) is zero for pi — 1), and its 
fitness is maximal; see also below. For 



C > B >0 and {B + Cf > AAC, 



(17) 



both TTi and 7r2 fall into the interval (0,1), see Fig. [1] 
while if one of conditions pT|) does not hold, both tti 
and 7r2 are not in this interval. Thus if ([T7]) hold, a stable 
fixed point tti emerges, which attracts all the trajectories 
that start from pi{0) < 1:2'- the polymorphism is created 
by the adaptation term oc C in (|15p . Initial condition 
larger than the unstable fixed point ■K2^ Pi(0) > 7r2, still 
tend to pi = 1; see Fig. [TJ Both stable fixed points 
TTi and 1 are Evolutionary Stable States (ESS), mean- 
ing that they cannot be invaded by a sufficiently small 
mutant population [13] . The coexistence of two ESS one 
of which is interior (i.e., polymorphic) is impossible for 
a two-player replicator equation with constant pay-offs 
14| . but it is possible for multi-player replicator equa- 
tion [1^ . We thus saw above an example of this behavior 
induced by time- varying environment. 

As we discussed below (fT2|) . the mean fitness does not 
contain the adaptive terms directly and is given as $ = 
2Api — Bpi (up to an irrelevant constant). For A > 
and j4 > B, $ maximizes at pi = 1, and this maximum 
is the only stable fixed point of the replicator dynamics 
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(fT5| with C = 0. If however C satisfies conditions ([T7)) . 
in the stable fixed point pi = tti the mean fitness $ is 
smaller than at the stable point pi — 1. Moreover, for 
the initial conditions tti < Pi(0) < 7^2, the mean fitness 
<i> decreases in the course of the relaxation to tti . 
The quantity which is increased by dynamics ([TS]) is 

* = 2Api Bp\ - Cplil - fi) = 'i' - Cplil - 

Thus is the Lyapunov function: > 0. Though 
— $ < whenever the adaptive polymorphism con- 
ditions ((T7]) hold, the difference between the Lyapunov 
function and the mean fitness is not negative for C < 0. 
Compare these facts to the evolution in a coarse-grained 
environment, which tends to maximize the average fitness 
minus a positive risk aversion factor; see [l^ for reviews. 

2. For B > A > [i.e., ai2 > an, 022] and C = there 
is a stable polymorphism at the fixed point pi ~ A/B 
(heterozygote advantage) . The presence of C 7^ in (fT5|) 
does not change this polymorphism; only the value of the 
fixed point shifts to tti . In contrast to the scenario 1, here 
the response to slow environmental changes is reversible. 

3. For B < A < Q and C = there is an unstable 
polymorphism: all the initial conditions with pi(0) < 
A/B end up at pi = (morph 2 dominates), while those 
with pi(0) > A/B finish at pi = 1 (morph 1 dominates). 
Now C 7^ in (fT5|) shifts the unstable fixed point to ^2- 

These are all possible scenarios for n ~ 2; other rela- 
tions between A and B lead to interchanging morphs. 

For three morphs, n = 3, we assume the zero-sum sit- 
uation in (l2|), akiir) = —aikir) [19]: the loss of the strat- 
egy I is equal to the gain of k. Equations ((9HTT|) reduce 
to 

Pi = PiaiaPa + H%PlP2pZ, Kl = 25i23, ^2 = 26213, (18) 

where '^i ~ 0. The four-party contribution disap- 

pears from (|18p . One can show that any interior fixed 
point of (HH]) can be either saddle (two real eigenvalues 
of the Jacobian with different sign) or center (two imag- 
inary, complex conjugate eigenvalues). 

4. For 0,12 > 0, ai3 > and = the morph 1 
globally dominates: pi — l\s the only stable fixed point. 
For the existence of polymorphism it is necessary that 
Ki < 0, i.e., the strategies 2 and 3 together win over 1, 
although separately they lose to 1. The dominance of 1 is 
still kept when p2 or p3 are forced to decay, because then 
the adaptive term in (fT8| is irrelevant for sufficiently large 
times. This happens when 023 < and K2 < or when 
023 > and K3 < 0. Apart from these cases the terms oc 
Ki in (jl8[) can lead to polymorphism, provided that their 
magnitude is large enough; see Fig. [2l Besides the stable 
fixed point pi = 1 of the = dynamics, two new fixed 
points emerge: stable (center) and unstable (saddle). A 
domain around the saddle supports polymorphism with 
cyclic dominance of the morphs; see Fig. [21 We see a 



general feature of all the above examples: the adaptive 
(multi-party) terms do not infiuence the local stability of 
the vertices (where all but one pk 's are zero) . 

5. For ai2 > 0, 013 < 0, 023 > there is polymor- 
phism already for = 0: 1 wins over 2, which wins over 
3, but 3 wins over 1 (rock-scissor-paper game). Now for 
Ki ^ in (flS)) there is already one interior fixed point, 
and the trajectories are closed orbits around this fixed 
point. After including the adaptive (oc k^) terms in (|18p 
this fixed point is simply shifted, and no new fixed points 
appear for any size or magnitude of Ki . 

To summarize, we have shown that in addition to the 
averaged payoffs, a fast [fine-grained] time-periodic en- 
vironment generates adaptive, multi-player terms in the 
replicator dynamics, provided that at least two morphs 
react on the environment differently. These terms can 
create a polymorphic stable state via adaptation of the 
"weak" morphs to environmental changes. This polymor- 
phism is related to decreasing mean fitness of the pop- 
ulation. This specific aspect of the polymorphism was 
argued to be a prerequisite for the phenomenon of sym- 
patric speciation, where by contrast to the allopatric sce- 
nario the speciation is induced inside a single population 
[2o| . Thus, our results hint at a sympatric speciation sce- 
nario due to a fine-grained, time-periodic environment. 
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